Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
Según | 3231 | 99 | 2 | 49.5000 |
Pero | 3904 | 124 | 3 | 41.3333 |
También | 2180 | 92 | 3 | 30.6667 |
1.00 | 1038 | 29 | 1 | 29.0000 |
cuales | 1340 | 54 | 2 | 27.0000 |
pues | 2203 | 96 | 4 | 24.0000 |
Su | 1856 | 160 | 7 | 22.8571 |
Esto | 1483 | 68 | 3 | 22.6667 |
Fue | 830 | 45 | 2 | 22.5000 |
Otro | 459 | 21 | 1 | 21.0000 |
Aunque | 1005 | 38 | 2 | 19.0000 |
Eso | 820 | 37 | 2 | 18.5000 |
Desde | 1188 | 51 | 3 | 17.0000 |
Actualmente | 361 | 17 | 1 | 17.0000 |
visualizarla | 312 | 17 | 1 | 17.0000 |
Estos | 694 | 51 | 3 | 17.0000 |
Ese | 520 | 30 | 2 | 15.0000 |
Además | 2650 | 15 | 1 | 15.0000 |
Rica | 639 | 15 | 1 | 15.0000 |
Es | 5207 | 132 | 9 | 14.6667 |
Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
abr | 153 | 1 | 25 | 0.0400 |
feb | 143 | 1 | 24 | 0.0417 |
dentro | 2011 | 3 | 71 | 0.0423 |
Published | 461 | 1 | 22 | 0.0455 |
miles | 681 | 1 | 18 | 0.0556 |
dic | 84 | 1 | 16 | 0.0625 |
capaz | 331 | 1 | 16 | 0.0625 |
cientos | 337 | 1 | 15 | 0.0667 |
Categoría | 262 | 1 | 14 | 0.0714 |
oct | 73 | 1 | 13 | 0.0769 |
integrada | 178 | 1 | 13 | 0.0769 |
énfasis | 156 | 1 | 13 | 0.0769 |
nov | 69 | 1 | 13 | 0.0769 |
ene | 179 | 2 | 26 | 0.0769 |
capaces | 226 | 1 | 12 | 0.0833 |
indican | 216 | 1 | 11 | 0.0909 |
decenas | 263 | 1 | 11 | 0.0909 |
convertirse | 305 | 1 | 11 | 0.0909 |
Friday | 101 | 1 | 11 | 0.0909 |
millones | 6788 | 39 | 401 | 0.0973 |
In this subsection, we compute the ratio of the number of right neighbors and the number of left neighbors. Again, we look for words with extreme ratios:
Data for first table:
select word,w.freq,aa.cnt, bb.cnt,aa.cnt/bb.cnt as r from words w, (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where w_id=aa.w1_id and aa.w1_id=bb.w2_id order by r desc limit 20;
Diagram data:
select aa.cnt, bb.cnt from (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where aa.w1_id=bb.w2_id;
5.1.7.1 Number of NN co-occurrences vs. Frequency I
5.1.7.2 Number of NN co-occurrences vs. Frequency II